167 results found.
Written
Evaluation Tool,
Language Type:
Multilingual
Languages:
English French German Hebrew Russian
Availability:
Freely Available
License:
Apache License, Version 2.0
Size:
62 MByte Production Status:
Newly created-in progress
Use:
Syntactic Evaluation (and Evaluation Set Generators)
-
Paper title:Cross-Linguistic Syntactic Evaluation of Word Prediction Models
-
Paper track:Long/Interpretability and Analysis of Models for NLP
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aaron Mueller | CLAMS: Cross-Linguistic Assessment of Models on Syntax | /N |
Documentation:
README.md on Github repository in English
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
From Data Center(s)
License:
Creative Commons Attribution - Pas d'Utilisation Commerciale - Partage dans les Mêmes Conditions
Size:
146 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Traitement des Corpus Oraux en Français (TCOF) | /N |
Documentation:
article DOI: 10.4000/pratiques.1597, french, public
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
French Italian
Availability:
From Data Center(s)
License:
ELRA non commercial use, ELRA commercial use, ELRA evaluation use
Size:
90.5 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | PORTMEDIA | /N |
Documentation:
Article link: https://www.researchgate.net/publication/225285476_Robustesse_et_portabilites_multilingue_et_multi-domaines_des_systemes_de_comprehension_de_la_parole_le_projet_PortMedia, french, public
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
From Data Center(s)
License:
Licence Creative Commons Attribution - Pas d'Utilisation Commerciale - Partage dans les Mêmes Conditions 4.0 International
Size:
78 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Multicultural Paris French (MPF) | /N |
Documentation:
https://journals.openedition.org/corpus/3049#tocto2n4, french, public
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English French German Italian Polish Portuguese Spanish
Availability:
Freely Available
License:
CC BY 4.0
Size:
None Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Multilingual LibriSpeech (MLS) | /N |
Documentation:
https://arxiv.org/abs/2012.03411, English, public
Speech
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
From Owner
License:
academic only, no commercial use
Size:
0.9 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | GEneva Multimodal Expression Portrayals (GEMEP) | /N |
Documentation:
https://www.unige.ch/cisa/gemep, https://www.researchgate.net/publication/51796867_Introducing_the_Geneva_Multimodal_Expression_Corpus_for_Experimental_Research_on_Emotion_Perception, English, public
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
From Data Center(s)
License:
Creative Commons Attribution-Noncommercial-Share Alike 3.0 License
Size:
59 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Corpus de Français Parlé Parisien des années 2000 (CFPP2000) | /N |
Documentation:
http://cfpp2000.univ-paris3.fr/, http://cfpp2000.univ-paris3.fr/CFPP2000.pdf, French, Public
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0)
Size:
1 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | CAnadian French Emotional speech dataset (CaFE) | /N |
Documentation:
DOI: 10.1145/3204949.3208121, English, public
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
License:
Creative Commons (BY/NC/ND) Licence
Size:
30 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Att-HACK | /N |
Documentation:
DOI: 10.21437/SpeechProsody.2020-152, English, public
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
French
Availability:
Freely Available
License:
Apache 2.0
Size:
22 hoursProduction Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | African Accented French | /N |
Documentation:
https://www.openslr.org/57/, https://www.army.mil/article/159362/Nigerien__Malian_soldiers_aid_US_Army_s_language_translation_technology, English, public




